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Abstract 

Symbolic ultrametrics define edge-colored complete graphs Kn and 
yield a simple tree representation of K^- We discuss, under which condi¬ 
tions this idea can be generalized to find a symbolic ultrametric that, in 
addition, distinguishes between edges and non-edges of arbitrary graphs 
G = {V, E) and thus, yielding a simple tree representation of G. We 
prove that such a symbolic ultrametric can only be defined for G if and 
only if G is a so-called cograph. A cograph is uniquely determined by 
a so-called cotree. As not all graphs are cographs, we ask, furthermore, 
what is the minimum number of cotrees needed to represent the topology 
of G. The latter problem is equivalent to find an optimal cograph edge 
^-decomposition {Ei,..., Ek} of E so that each subgraph {V, Ei) of G is 
a cograph. An upper bound for the integer k is derived and it is shown 
that determining whether a graph has a cograph 2-decomposition, resp., 
2-partition is NP-complete. 


1 Introduction 

Given an arbitrary edge-colored complete graph Kn = (U E) on n vertices, 
Bocker and Dress [4] asked, whether there is a tree representation of this 
i.e., a rooted tree T = {W,F) with leaf set V together with a labeling t of 
the non-leaf vertices in W \ E so that the least common ancestor lca(a;, y) of 
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distinct leaves x and y is labeled with the respective color of the edge [a;, y] S E. 
The pair (T, t) is then called symbolic representation of the edge-colored graph 
Kn- The authors showed, that there is a symbolic representation (T, t) if and 
only if the map i5 that assigns colors or symbols to the edges in E fulfills the 
properties of a so-called symbolic ultrametric [3] . Such maps are crucial for the 
characterization of relationships between genes or proteins, so-called orthology 
relations [laiiaiis], that lie at the heart of many phylogenomic studies. 

Inspired by the work of Bocker and Dress, we address the following problem: 
Does there exist, for an arbitrary undirected graph G = {V,E) a symbolic 
ultrametric 6 : V x V ^ M and thus, a symbolic representation (T, t) of G so 
that one can distinguish between edges and non-edges of G? In other words, we 
ask for a coloring S of the edges [x, y] S E, as well as the non-edges [a;, y] ^ E, 
so that the topology of G can be displayed by a rooted vertex-labeled tree 
{T,t) s.t. for all distinct vertices x,y € V the labeling of the lowest common 
ancestor lca(a;, y) is equal to S{x, y). The Hrst result of this contribution provides 
that such a symbolic ultrametric can only be defined for G if and only if G 
is a cograph. This, in particular, establishes another new characterization of 
cographs. 

Cographs are characterized by the absence of induced paths P 4 on four ver¬ 
tices. Moreover, Lerchs [T1II3 showed that each cograph G = (D, E) is asso¬ 
ciated with a unique rooted tree T{G), called cotree. Obviously, not all graphs 
are cographs and thus, don’t have a cotree representation. Therefore, we ask 
for the minimum number of cotrees that are needed to represent the structure 
of a given graph G = {V, E) in an unambiguous way. As it will turn out, this 
problem is equivalent to find a decomposition 11 = {Gi, ... ,Ek} oi E (the ele¬ 
ments of n need not necessarily be disjoint) for the least integer fc, so that each 
subgraph Gi = {V,Ei), 1 < i < fc is a cograph. Such a decomposition is called 
cograph edge fc-decomposition, or cograph fc-decomposition, for short. If the 
elements of 11 are in addition pairwise disjoint, we call 11 a cograph A:-partition. 
We will prove that finding the least integer k > 2 so that G has a cograph 
fc-decomposition or a cograph fc-partition is an NP-hard problem. Moreover, 
upper bounds for the integer fc for any cograph fc-decomposition are derived. 
These findings complement results known about so-called cograph vertex parti¬ 
tions [Tl[TIlll0l[26]. 

2 Basic Definitions 

Graph. In what follows, we consider undirected simple graphs G = (V, E) 
with vertex set V{G) = V and edge set E{G) = E C (^). The complement 
graph G^ = (P,G^) of G = (I/,G), has edge set E^ = (^) \ E. The graph 
K\v\ = (P, E) with E = (^) is called complete graph. A graph El = (W, E) 
is an induced subgraph of G = {V,E), ii W C V and all edges [x,y] € E with 
x,y £W are contained in E. The degree deg(r:) = \{e € E \ v € e}\ oi a vertex 
r; £ y is defined as the number of edges that contain v. The maximum degree 
of a graph is denoted with A. 
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Rooted Tree. A connected graph T is a tree, if T does not contain cycles. A 
vertex of a tree T of degree one is called a leaf of T and all other vertices of T are 
called inner vertices. The set of inner vertices of T is denoted by V^. A rooted 
tree T = {V, E) is a tree that contains a distinguished vertex pT & V called 
the root. The first inner vertex lcaT(a;, y) that lies on both unique paths from 
distinct leaves x, resp., y to the root, is called most recent common ancestor of 
X and y. If there is no danger of ambiguity, we will write lca(a;,y) rather then 
lcaT(a;,y). 

Symbolic Ultrametric and Symbolic Representation. In what follows, the set M 
will always denote a non-empty finite set, the symbol O will always denote a 
special element not contained in M, and M® := M Li {©}. Now, suppose X is 
an arbitrary non-empty set and S : X x X ^ M® a map. We call 6 a symbolic 
ultrametric if it satisfies the following conditions: 

(UO) S{x, y) = 0 if and only if a; = y; 

(Ul) 5{x,y) = 5{y,x) for all x,y G X, i.e. S is symmetric; 

(U2) |{(5(a;, y), 6(x, z), 5{y, 2 )}| < 2 for all x,y, z G X- and 

(U3) there exists no subset {x,y,u,v} G ('^) such that 5{x,y) = 5{y,u) = 
5{u,v) ^ 5{y,v) = 6{x,v) = S{x,u). 

Now, suppose that T = (V, E) is a rooted tree with leaf set X and that t : V ^ 
M® is a map such that t{x) = © for all x G X. To the pair {T,t) we associate 
the map d(T,t) on A x A by setting, for all x,y G X, 

d(T,t) : A X A ^ M®;d(^T,t){x,y) = t{lca,T{x,y)). (1) 

Clearly this map is symmetric and satisfies (UO). We call the pair {T,t) a 
symbolic representation of a map 5 : A x A —)• M®, if S{x,y) = d(^T,t)ix,y) 
holds for all x,y G X. For a subset W C A x A we denote with 5{W) the 
restriction of 5 to the set W. 

Cographs and Cotrees. Complement-reducible graph, cographs for short, are 
defined as the class of graphs formed from a single vertex under the closure of the 
operations of union and complementation, namely: (i) a single-vertex graph is a 
cograph; (ii) the disjoint union of cographs is a cograph; (iii) the complement of a 
cograph is a cograph. Alternatively, a cograph can be defined as a P 4 -free graph 
(i.e. a graph such that no four vertices induce a subgraph that is a path of length 
3), although there are a number of equivalent characterizations of such graphs 
(see e.g. [6] for a survey). It is well-known in the literature concerning cographs 
that, to any cograph G, one can associate a canonical cotree T{G) = {V,E). 
This is a rooted tree, leaf set equal to the vertex set V(G) of G and inner vertices 
that represent so-called ’’join” and ’’union” operations together with a labeling 
map t : V° ^ {0,1} such that for all [a;, y] S E{G) it holds that t(lca(a;, y)) = 1, 
and t{v) ^ t{wi) for all v G and all children wi,...,Wk G U° of u, (cf. [8]). 

Cograph k-Decomposition and Partition, and Cotree Representation. Let G = 
{V,E) be an arbitrary graph. A decomposition II = {Ei,... Ek} of A is a 
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called (cograph) k-decomposition, if each subgraph Gi = {V,Ei), 1 < i < k 
of G is a cograph. We call 11 a (cograph) k-partition if Ei fl Ej = 0, for all 
distinct i,j G {1,..., k}. A fc-decomposition 11 is called optimal, if If has the 
least number k of elements among all cograph decompositions of G. Clearly, for 
a cograph only k-decompositions with k = 1 are optimal. A /c-decomposition 
n = {El,... Ek} is coarsest, if no elements of If can be unified, so that the 
resulting decomposition is a cograph ^-decomposition, with I < k. In other 
words, n is coarsest, if for all subsets / C fc} with |/| > 1 it holds 

that {V,Lii^iEi) is not a cograph. Thus, every optimal fc-decomposition is also 
always a coarsest one. 

A graph G = (V, E) is represented by a set of cotrees T = {Ti,..., T^}, each 
Ti with leaf set V, if and only if for each edge [x, y] G E there is a tree Ti gT 
with t{\ca,Ti{x,y)) = 1. 

The Cartesian (Graph) Product GDi/ has vertex set V (GDi/) = V (G) x V(H); 
two vertices {gi,hi), {g 2 ,h 2 ) are adjacent in GDH if [ 31 , 32 ] G E{G) and hi = 
^ 2 , or [hi, ^ 2 ] G E{E[) and 31 = 32 - It is well-known that the Cartesian product 
is associative, commutative and that the single vertex graph Ki serves as unit 
element [H]. Thus, the product of arbitrary many factors Gi,..., G„ 

is well-defined. For a given product we define the G^-layer G™ of G 

(through vertex w that has coordinates (wi,... ,Wn)) as the induced subgraph 
with vertex set V{Gf) = {v = [vi,... ,Vn) G x(L^I^(Gi) | Vj = Wj, for all j 
i}. Note, G™ is isomorphic to Gi for all 1 < i < n, w G V{D)'^iGi). The n-cube 
Qn is the Cartesian product 

3 Symbolic Ultrametrics 

Symbolic ultrametrics and respective representations as event-labeled trees, 
have been first characterized by Bocker and Dress [4]. 

Theorem 3.1 ([H [T3] ). Suppose 5 \ V x V —>■ M® is a map. Then there 
is a symbolic representation of 5 if and only if 6 is a symbolic ultrametric. 
Furthermore, this representation can be computed in polynomial time. 

Let 5 ■. V xV ^ M® be a map satisfying Properties (UO) and (Ul). For 
each fixed m G M, we define an undirected graph Gm ■= Gm{S) = {V, Em) with 
edge set 

Em = {{x,y} \ S{x,y) = m, x,y gV} . (2) 

Thus, the map S can be considered as an edge coloring of a complete graph , 
where each edge [x, y] obtains color 5{x, y). Hence, Gm denotes the subgraph of 
the edge-colored graph K\v\i that contains all edges colored with m G M. The 
following result establishes the connection between symbolic ultrametrics and 
cographs. 

Theorem 3.2 ([IS])- Let 6 : V xV ^ M® be a map satisfying Properties (UO) 
and (Ul). Then 6 is a symbolic ultrametric if and only if 
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(U2’) For all {x,y,z} G (^) there is an m G M such that Em contains two of 
the three edges {x,y}, {x^z}, and {y,z}. 

(US’) Gm is a cograph for all m G M. 

Assume now, we have given an arbitrary subgraph G = (V, E) C K\y\. Let S 
be a map defined on t/ x y so that edges e G E obtain a different color then the 
non-edges e G E{K^y^) \E oi G. The questions then arises, whether such a map 
fulfilling the properties of symbolic ultrametric can be defined and thus, if there 
is tree representation (T, t) of G. Of course, this is possible only if i5 restricted 
to E, resp., is a symbolic ultrametric, while it also a symbolic ultrametric on 
the complete graph = {V,EU E‘^). The next theorem answers the latter 
question and, in addition, provides a new characterization of cographs. 

Theorem 3.3. Let G = (V,E) be an arbitrary (possibly disconnected) graph, 
W = {{x,y) G V X V \ [x,y] G E} and W” = {ix,y) G V x V \ [x,y\ ^ E}. 
There is a symbolic ultrametric 5 :V xV ^ M® s.t. 5i(W) fl S{W'^) =% if and 
only if G is a cograph. 

Proof. First assume that G is a cograph. Set S{x,x) = © for all x G F and 
set 5(x,y) = S{y,x) = 1 if [x,y\ G E and, otherwise, to 0. Hence, condition 
(170) and (C/1) are fulfilled. Moreover, by construction \M\ = 2 and thus, con¬ 
dition (t/2') is trivially fulfilled. Furthermore, since Gi(5) and its complement 
Go(5) are cographs, (C/3') is satisfied. Theorem 13.21 implies that ^ is a symbolic 
ultrametric. 

Now, let (5 : H X H —>■ M® be a symbolic ultrametric with 5i(W)U5{W‘’) = 0. 
Assume for contradiction that G is not a cograph. Then G contains an induced 
path P 4 = a — b — c — d. Therefore, at least one edge e of this path P 4 must 
obtain a color 5{e) different from the other two edges contained in this P 4 , as 
otherwise Gg(^e)i^) is not a cograph and thus, S is not a symbolic ultrametric 
(Theorem 13.21) . For all such possible maps <5 “subdividing” this P 4 we always 
obtain that two edges of at least one of the underlying paths P 3 = a — b — c 01 
b — c—d must have different colors. W.l.o.g. assume that 6 {a,b) ^ 6 {b,c). Since 
[a, c] ^ E and 5{W) fl 5(bF“) = 0 we can conclude that 5{a,c) y/ 6 {a,b) and 
S(a,c) 7 / 6 {b,c). But then condition (C/2') cannot be satisfied, and Theorem l3.2l 
implies that S is not a symbolic ultrametric. □ □ 

The latter result implies, that there is no hope for finding a map S for a 
graph G, that assigns symbols or colors to edges, resp., non-edges such that for 
S (and hence, for G) there is a symbolic representation (T, t), unless G is already 
a cograph. In other words, every symbolic representation (T, t) for an arbitrary 
graph G (which only exists if G is a cograph) is a cotree. However, this result 
does not come as a big surprise, as a cograph G is characterized by the existence 
of a unique (up to isomorphism) cotree (T, t) representing the topology of G. 
The (decision version of the) problem to edit a given graph G into a cograph 
G', and thus, to find the closest graph G' that has a symbolic representation, 
is NP-complete [Ulllg]. In this contribution, we are interested in the following 
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Figure 1: Full enumeration of all possibilities (which we leaf to the reader), 
shows that the depicted graph has no cograph 2-decomposition. However, it 
has a cograph 3-decomposition that is highlighted by dashed-lined, dotted and 
bold edges. 


problem: What is the minimum number of cotrees that are needed to represent 
the topology of G in an unambiguous way? 

4 Cotree Representation and Cograph k- 
Decomposition 

Recollect, a graph G = (H, E) is represented by a set of cotrees T = {Ti,..., T^}, 
if and only if for each edge [a;, y] & E there is a tree Ti € T with tilcaxi (x, y)) = 1. 
Note, by dehnition, each cotree Ti determines a subset Ei = {[x,y] G E \ 
tilc^Ti {x, ?/)) = 1} of E. Hence, the subgraph (F, Ei) must be a cograph. There¬ 
fore, in order to find the minimum number of cotrees representing a graph G, 
we can equivalently ask for a decomposition H = {Ei,..., Ek} oi E so that each 
subgraph (F, Ei) is a cograph, where k is the least integer among all cograph 
decompositions of G. Thus, we are dealing with the following two equivalent 
problems. 

Problem. Cotree ^Representation 

Input: Given a graph G = (F, E) and an integer k . 

Question: Can G be represented by k cotrees? 

Problem. Cograph fc-DECOMPOSmoN 

Input: Given a graph G = (F, E) and an integer k. 

Question: Is there a cograph k-decomposition of G? 

Clearly, any cograph has an optimal 1-decomposition, while for cycles of 
length > 4 or paths P 4 there is always an optimal cograph 2-decomposition. 
However, there are examples of graphs that do not have a 2-decomposition, see 
Figure [H To derive an upper bound for the integer k s.t. there is a cograph 
fc-decomposition for arbitrary graphs, the next theorem is given. 
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Theorem 4.1. For every graph G with maximum degree A there is a cograph 
k-decomposition with \<k< A + 1 that can be computed in 0{\V\\E\ + A(\V\ + 
|i?|)) time. Hence, any graph can be represented by at most A + 1 cotrees. 

Proof. Consider a proper edge-colorings (p : E ^ {1,... ,k} oi G, i.e., an edge 
coloring such that no two incident edges obtain the same color. Any proper 
edge-coloring using k colors yields a cograph fc-partition 11 = {Ei,... ,Ek} where 
Ei = {e € E \ p(e) = *}, because any connected component in Gi = {V, Ei) is an 
edge and thus, no P 4 ’s are contained in Gi. Vizing’s Theorem [25] implies that 
for each graph there is a proper edge-coloring using k colors with A < fc < A + 1. 

An proper edge-coloring using at most A + 1 colors can be computed with 
the Misra-Gries-algorithm in Od + llAl) time [50]. Since the (at most A + 1) 
respective cotrees can be constructed in linear-time 0(| + | + |A|) [^, we derive 
the runtime OdyilAl + A(|F| + lAD). □ □ 

Obviously, any optimal fc-decomposition must also be a coarsest k- 
decomposition, while the converse is in general not true, see Fig(2] The partition 
n = {El,... ,Ek} obtained from a proper edge-coloring is usually not a coarsest 
one, as possibly {V, Ej) is a cograph, where Ej = UigjAi and J C 
A graph having an optimal cograph A-decomposition is shown in Fig. [T] Thus, 
the derived bound A + 1 is almost sharp. Nevertheless, we assume that this 
bound can be sharpened: 

Conjecture 1. For every graph G with maximum degree A there is a cograph 
A-decomposition. 

However, there are examples of non-cographs containing many induced P^s 
that have a cograph fc-decomposition with fc +; A + 1, which implies that any 
optimal fc-decomposition of those graphs will have significantly less elements 
than A + 1, see the following examples. 

Example 1. Consider the graph G = (+, E) with vertex set V = {1,..., fc} U 
{a, 6} and E = {[i,j] \ i,j € {l,...,k},i ^ j} U {[/c, a], [a, &]}. The graph 
G is not a cograph, since there are induced P 4 ’s of the form i — k — a — b, 
iG{l,...,fc—1}. On the other hand, the subgraph H = (V, £'\ {[A:, a]}) has two 
connected components, one is isomorphic to the complete graph Kk on k vertices 
and the other to the complete graph K 2 . Hence, H is a cograph. Therefore, G 
has a cograph 2-partition {E\ {[fc, a]}, {[/c, a]}}, independent from k and thus, 
independent from the maximum degree A = k. 

Example 2. Consider the 2n-dimensional hypercube Q^n = (F,A) with max¬ 
imum degree 2n. We will show that this hypercube has a coarsest cograph 
n-partition H = {Ei,... ,En}, which implies that for any optimal cograph k- 
decomposition of Q 2 n we have k < A/2. 

We construct now a cograph n-partition of Q 2 n- Note, Q 2 n = 0'f2iK2 = 
□/^]^(A' 2 niF 2 ) = In order to avoid ambiguity, we write o,s 

Hi ~ <52 nnd assume that Q 2 has edges [0,1], [1,2], [2,3], [3,0]. 
The cograph n-partition of Q 2 n is defined as H = {Ei,..., En}, where Ei = 


7 



9 O OOO OOO OOO 0^0 0^00 


Figure 2: The shown (non-co)graph G has a 2-decomposition 11 = {Ei,E 2 }. 
Edges in the different elements Ei and E 2 are highlighted by dashed and solid 
edges, respectively. Thus, two cotrees, shown in the lower part of this picture, 
are sufficient to represent the structure of G. The two cotrees are isomorphic, 
and thus, differ only in the arrangement of their leaf sets. For this reason, we 
only depicted one cotree with two different leaf sets. Note, G has no 2-partition, 
but a coarsest 3-partition. The latter can easily be verified by application of 
the construction in Lemma 14.21 


Uv^vE{H^). In other words, the edge set of all Hi-layers in Q 2 n constitute a 
single class Ei in the partition for each i. Therefore, the subgraph G = {V, Ei) 
consists of n connected components, each component is isomorphic to the square 
Q 2 - Hence, Gi = {V,Ei) is a cograph. 

Assume for contradiction that 11 = {Ei,... ,En\ is not a coarsest par¬ 
tition. Then there are distinct classes Ei, i G I C {!,...,n} such that 
Gi = (V,Ui^iEi) is a cograph. W.l.o.g. assume that 1,2 € / and let v = 
(0,..., 0) G V. Then, the subgraph HfyjHlf C Q 2 n contains a path P 4 with edges 
€ E{H'f) and [r!,a],[a, &] € E{H 2 ), where x=(l,0,... ,0), a=(0,l,0... ,0) 
and b = (0,2,0 ... ,0). By definition of the Cartesian product, there are no edges 
connecting x with a or b or v with b in Q 2 n o,nd thus, this path P 4 is induced. 
As this holds for all subgraphs Hf UiLJ (i, j € I distinct) and thus, in particular 
for the graph Gi we can conclude that classes of 11 cannot be combined. Hence 
n is a coarsest cograph n-partition. 

Because of the results of computer-aided search for n — 1-partitions and 
decompositions of hypercubes Q 2 n we are led to the following conjecture: 
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Conjecture 2. Let k gN and k > 1. Then the 2k-cube has no cograph k — 1- 
decomposition, i.e., the proposed k-partition of the hypercube Q 2 k in Example\^ 
is also optimal. 

The proof of the latter hypothesis would immediately verify the next con¬ 
jecture. 

Conjecture 3. For every k G N there is a graph that has an optimal cograph 
k-decomposition. 

Proving the last conjecture appears to be difficult. We wish to point out that 
there is a close relationship to the problem of finding pattern avoiding words, 
see e.g. 0 [a [21 [22 1 [2]: Consider a graph G = {V,E) and an ordered list 
(ei,..., Cm) of the edges Ci S E. We can associate to this list (ei,..., e^) a 
word w = {wi,... ,Wm)- By way of example, assume that we want to find a 
valid cograph 2-decomposition {Ei , if 2 } of G and that G contains an induced 
P 4 consisting of the edges ei,ej,ek. Hence, one has to avoid assignments of the 
edges ei,ej,ek to the single set Ei, resp., E 2 . The latter is equivalent to find 
a binary word (wi,..., Wm) such that (wi, Wj,Wk) yf {X, X, X), X G {0,1} for 
each of those induced Hi’s. The latter can easily be generalized to find pat¬ 
tern avoiding words over an alphabet {1,..., fc} to get a valid fc-decomposition. 
However, to the authors knowledge, results concerning the counting of fc-ary 
words avoiding forbidden patterns and thus, verifying if there is any such word 
(or equivalently a fc-decomposition) are basically known for scenarios like: If 
(pi ,. ■ .pi) G {1,..., ky (often I < 3), then none of the words w that contain 
a subword = (j)i,...pi) with ij^i = ij + 1 (consecutive letter 

positions) or ij < i^ whenever j < k (order-isomorphic letter positions) is al¬ 
lowed. However, such findings are to restrictive to our problem, since we are 
looking for words, that have only on a few, but fixed positions of non-allowed 
patterns. Nevertheless, we assume that results concerning the recognition of 
pattern avoiding words might offer an avenue to solve the latter conjectures. 

4.1 NP-completeness and NP-hardness Results 

We are now in the position to prove the NP-completeness of Cotree 2- 
Representation and Cotree 2-Decomposition. These results allow to 
show that the problem of determining whether there is cograph 2-partition is 
NP-complete, as well. 

We start with two lemmata concerning cograph 2-decompositions of the 
graphs shown in Fig. [3] and S) 

Lemma 4.2. For the literal and extended literal graph in Figure\^every cograph 
2 -decomposition is a uniquely determined cograph 2 -partition. 

In particular, in every cograph 2-partition {Ei,E 2 } of the extended literal 
graph, the edges of the triangle (0,1, 2) must be entirely contained in one Ei and 
the pending edge [6,9] must be in the same edge set Ei as the edges of the of 
the triangle. Furthermore, the edges [9,10] and [9,11] must be contained in Ej, 

* j- 
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Proof. It is easy to verify that the given cograph 2-partition {i5i, £’ 2 } in Fig. [3] 
fulfills the conditions and is correct, since G = (fy£i) and G = {¥,£ 2 ) do not 
contain induced P 4 ’s and are, thus, cographs. We have to show that it is also 
unique. 

Assume that there is another cograph 2-decomposition {Fi,F 2 }. Note, for 
any cograph 2 -decomposition {Fi,F 2 } it must hold that two incident edges in 
the triangle (0,1, 2) are contained in one of the sets Fi or £ 2 - W.l.o.g. assume 
that [ 0 , 1 ], [ 0 , 2 ] € £ 1 . 

Assume first that [1,2] ^ £ 1 . In this case, because of the paths £4 = 
6 — 2 — 0 — 1 and £4 = 2 — 0—1 — 5 it must hold that [2, 6 ], [1, 5] ^ £1 and thus, 

[2. 6 ] , [1,5] € £ 2 . However, in this case and due to the paths £4 = 6 — 2 — 1 — 4 
and 2 — 0 — 1 — 4 the edge [1,4] can neither be contained in £1 nor in £ 2 , a 
contradiction. Hence, [1,2] g £ 1 . 

Note, the square S '1256 induced by vertices 1, 2, 5, 6 cannot have all edges in 
£ 1 , as otherwise the subgraph (fy £ 1 ) would contain the induced £4 = 6 — 5 — 
1 — 0 . Assume that [1,5] G £ 4 . As not all edges S '1256 are contained in £ 1 , at 
least one of the edges [5, 6 ] and [2, 6 ] must be contained in £ 2 . If only one of the 
edges [5,6], resp., [2,6] is contained in £ 2 , we immediately obtain the induced 
£4 = 6 — 2 — 1 — 5, resp., 6 — 5 — 1 — 2 in (fy £ 1 ) and therefore, both edges 

[5.6] and [2,6] must be contained in £ 2 . But then the edge [2,7] can neither 
be contained in £1 (due to the induced £4 = 6 -1 — 2 — 7) nor in £2 (due 
to the induced £4 = 5 — 6 — 2 — 7), a contradiction. Hence, [1,5] ^ £1 and 
thus, [1,5] G £2 for any 2-decomposition. By analogous arguments and due to 
symmetry, all edges [0,3], [0,8], [1,4], [2,6], [2,7] are contained in £ 2 , but not 
in £ 1 . 

Moreover, due to the induced £4 = 7 — 2 — 6 — 5 and since [2, 6 ], [2, 7] G £ 2 , 
the edge [5,6] must be in £1 and not in £ 2 . By analogous arguments and due 
to symmetry, it holds that [3,4], [7,8] G £1 and [3,4], [7,8] ^ £ 2 . Finally, none 
of the edges of the triangle ( 0 , 1 , 2 ) can be contained in £ 2 , as otherwise, we 
obtain an induced £4 in (fy £ 2 ). Taken together, any 2-decomposition of the 
literal graph must be a partition and is unique. 

Consider now the extended literal graph in Figure [S] As this graph contains 
the literal graph as induced subgraph, the unique 2-partition of the underlying 
literal graph is determined as by the preceding construction. Due to the path 
£4 = 7 — 2 — 6 — 9 with [2,6], [2, 7] G £2 we can conclude that [6, 9] ^ £2 and thus 
[ 6 , 9] G £ 1 . Since there are induced paths £4 = 5 — 6 — 9 — y, y = 10,11 with 
[5, 6], [6, 9] G £1 we obtain that [9,10], [9,11] ^ £1 and thus, [9,10], [9,11] G £2 
for any 2-decomposition (which is in fact a 2-partition) of the extended literal 
graph, as claimed. □ 

Lemma 4.3. Given the clause gadget in Fig. 

For any cograph 2-decomposition, all edges of exactly two of the triangles in 
the underlying three extended literal graphs must be contained in one Ei and not 
in Ej, while the edges of the triangle of one extended literal graph must be in 
Ej and not in Ei, i ^ j. 
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Figure 3: Left the literal graph and right the extended literal graph with unique 
corresponding cograph 2-partition (indicated by dashed and bold-lined edges) 
is shown. 



Figure 4: Shown is a clause gadget which consists of a triangle (a, b, c) and 
three extended literal graphs (as shown in Fig.[3l) with edges attached to (a, b, c). 
A corresponding cograph 2-partition is indicated by dashed and bold-line edges. 

Furthermore, for each cograph 2-decomposition exactly two of the edges e,e' 
of the triangle {a, b, c) must be in one Ei while the other edge f is in Ej but not 
in Ei, j ^ i. The cograph 2-decomposition can be chosen so that in addition 
e,F ^ Ej, resulting in a cograph 2-partition of the clause gadget. 

Proof. It is easy to verify that the given cograph 2-partition in Fig. |4] fulfills 
the conditions and is correct, G = {V, Ef) and G = {V,E 2 ) are cographs. 

As the clause gadget contains the literal graph as induced subgraph, the 
unique 2-partition of the underlying literal graph is determined as by the con¬ 
struction given in Lemma [4. 2 1 Thus, each edge of the triangle in each underlying 
literal graph is contained in either one of the sets Ei or E 2 . Assume that edges 
of the triangles in the three literal gadgets are all contained in the same set, say 
El. Then, Lemma 14.21 implies that [9, a], [9, c], [9', a], [9', b], [9", b], [9", c] € Ei 
and none of them is contained in E 2 . Since there are induced P^s: 9 — a — b — 9", 
9' — a —c —9" and 9 — c — 6 — 9', the edges [a, 6], [a, c], [6, c] cannot be contained 
in El, and thus must be in E 2 . However, this is not possible, since then we 
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would have the induced paths P4 = 9 — a — 9' — 6 in the subgraph (F, E{) a con¬ 
tradiction. Thus, the edges of the triangle of exactly one literal gadget must be 
contained in a different set Ei than the edges of the other triangles in the other 
two literal gadgets. W.l.o.g. assume that the 2-decomposition of the underlying 
literal gadgets is given as in Fig. 01 and identify bold-lined edges with Ei and 
dashed edges with i?2- 

It remains to show that this 2-decomposition of the underlying three literal 
gadgets determines which of the edges of triangle (a, 6, c) are contained in which 
of the sets Ei and E 2 . Due to the induced path 9 — a — b — 9" and since 
[9, a], [9", b] € E 2 , the edge [a, b] cannot be contained in E 2 and thus, is contained 
in El. Moreover, if [6,c] ^ E 2 , then then there is an induced path P 4 = 
6 — 9" — c — 9 in the subgraph (F, E 2 ), a contradiction. Hence, [6, c] S E 2 and 
by analogous arguments, [a, c] G i?2- If [b,c\ ^ Ei and [a,c] ^ Ei, then we 
obtain a cograph 2-partition. However, it can easily be verified that there is 
still a degree of freedom and [a,c],[6,c] G Ei is allowed for a valid cograph 
2-decomposition. □ 

We are now in the position to prove NP-completeness of Cograph 2- 
Partition by reduction from the following problem. 

Problem. Monotone NAE 3-SAT 
Input: Given a set U of Boolean variables and a set of elauses 

ip = {Cl, ..., Cm} over U such that for all i = 1, ... ,m 
it holds that \Ci\ =3 and Ci contains no negated variables. 
Question: Is there a truth assignment to if such that in each Ci 

not all three literals are set to true? 

Theorem 4.4 ([24l[21]). Monotone NAE 3-SAT is NP-complete. 

Theorem 4.5. Cograph 2-Decomposition, and thus, Cotree 2- 
Representation is NP-complete. 

Proof. Given a graph C = {V,E) and cograph 2-decomposition {Ei,E 2 }, one 
can verify in linear time whether {V,Ei) is a cograph [9]. Hence, COGRAPH 
2-Partition G NP. 

We will show by reduction from Monotone NAE 3-SAT that Cograph 2- 
Decomposition is NP-hard. Let ip = (Ci,..., Cm) be an arbitrary instance of 
Monotone NAE 3-SAT. Each clause Ci is identified with a triangle (oi, bi, a). 
Each variable Xj is identified with a literal graph as shown in Fig. [3] (left) 
and different variables are identihed with different literal graphs. Let Ci = 
(xij, Xij, Xig) and Gij, and Ci,^ the respective literal graphs. Then, we 
extend each literal graph Ci- by adding an edge [6,9ij]. Moreover, we add to 
Gji the edges [9^,1,Oi], [9i,i, c*], to G^^ the edges [ 9 i^ 2 ,ai],[ 9 i^ 2 ,bi\, to Gj3 the 
edges [9i_3, Ci], [9i_3, bi]. The latter construction connects each literal graph with 
the triangle (oi, bi, cQ of the respective clause Ci in a unique way, see Fig. 0] We 
denote the clause gadgets by 4*^ for each clause Ci. We repeat this construction 
for all clauses Ci of ip resulting in the graph 4'. An illustrative example is given 
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Xi 


X2 xs Xi x^ xe 


Figure 5: Shown is the graph 'I' as constructed in the proof of Theorem 14.51 
In particular, reflects the NAE 3-SAT formula ijj = {Ci, C 2 , C 3 } with clauses 
Cl = (xi, X 4 , X 2 ), (72 = (x 2 ,X 3 ,X 4 ) and C 3 = (a; 4 , xs, xe). Different literals ob¬ 
tain the same truth assignment true or false, whenever the edges of the triangle 
in their corresponding literal gadget are contained in the same set Ei of the 
cograph 2 -partition, highlighted by dashed and bold-lined edges. 


in Fig. [SJ Clearly, this reduction can be done in polynomial time in the number 
m of clauses. 

We will show in the following that 4' has a cograph 2-decomposition (resp., 
a cograph 2 -partition) if and only if '0 has a truth assignment /. 

Let 0 = {Cl ,..., Cm) have a truth assignment. Then in each clause Ci at 
least one of the literals Xi ^, Xi ^, Xi^ is set to true and one to false. We assign all 
edges e of the triangle in the corresponding literal graph Ci- to Ei, if f{xi^) = 
true and to E 2 , otherwise. Hence, each edge of exactly two of the triangles (one 
in Gi^ and one in Gi^, contained in one Ey and not in Eg, while the edges of 
the other triangle in Gi .„, j" ^ j,j' are contained in Eg and not in Er, r ^ s, 
as needed for a possible valid cograph 2-decomposition iLemma 14.31) . We now 
apply the construction of a valid 2 -decomposition (or 2 -partition) for each as 
given in Lemma [4.31 starting with the just created assignment of edges contained 
in the triangles in Gi^, Ci., and Gi.„ to Ei or A 2 . In this way, we obtain a 
valid 2-decomposition (or 2-partition) for each subgraph (fi of 41. Thus, if there 
would be an induced P4 in 4' with all edges belonging to the same set Er, then 
this P4 can only have edges belonging to different clause gadgets 4'fc,4>;. By 
construction, such a P4 can only exist along different clause gadgets 4'^ and 4>; 
only if Ck and Ci have a literal Xi = Xk^ = in common. In this case, Lemma 
l4.3l imDlies that the edges [ 6 , 9k,m] and [ 6 , 9i,n] in 4^^ must belong to the same set 
Er- Again by Lemma[431 the edges [9k,mjy] and [9k,m,y'], V,y' £ {ak,bk,Ck} 
as well as the edges y] and [9i,n, y'], y, y' £ {o/, 6 /, c/} must be in a different 
set Eg than [ 6 , and [ 6 , 9;,„]. Moreover, respective edges [5,6] in 4'^, as 
well as in 4'/ (Fig. |3|) must then be in Er, i.e., in the same set as [ 6 , 9k,m] and 
[ 6 , 9 i,n]- However, in none of the cases it is possible to find an induced P4 with 
all edges in the same set Er or Eg along different clause gadgets. Hence, we 
obtain a valid cograph 2-decomposition, resp., cograph 2-partition of 4'. 

Now assume that 4' has a valid cograph 2-decomposition (or a 2-partition). 
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Any variable Xi^ contained in some clause Ci = {xi^, Xi^, Xi^) is identified with 
a literal graph Gi^. Each clause Ci is, by construction, identified with exactly 
three literal graphs Gi^, Gi^, Gi^, resulting in the clause gadget 4'i. Each literal 
graph Gij. contains exactly one triangle tj. Since 'I'i is an induced subgraph of 
4', we can apply Lemma l4.3l and conclude that for any cograph 2-decomposition 
(resp., 2-partition) all edges of exactly two of three triangles are con¬ 

tained in one set E^, but not in Es, and all edges of the other triangle are 
contained in Es, but not in Er, s ^ r. Based on these triangles we define a 
truth assignment / to the corresponding literals: w.l.o.g. we set f{xi) =true 
if the edge e € is contained in Ei and f{xi) =false otherwise. By the lat¬ 
ter arguments and Lemma 14.31 we can conclude that, given a valid cograph 
2 -partitioning, the so defined truth assignment / is a valid truth assignment of 
the Boolean formula ip, since no three different literals in one clause obtain the 
same assignment and at least one of the variables is set to true. Thus, Cograph 
2-Decomposition is NP-complete 

Finally, because Cograph 2-Decomposition and Cotree 2- 
Representation are equivalent problems, the NP-completeness of Cotree 
2-Representation follows. □ 

As the proof of Theorem 14.51 allows us to use cograph 2-partitions in all 
proof steps, instead of cograph 2-decompositions, we can immediately infer the 
NP-completeness of the following problem for k=2, as well. 

Problem. Cograph /c-Partition 
Input: Given a graph G = (V, E) and an integer k. 

Question: Is there a Cograph k-Partition of G? 

Theorem 4.6. Cograph 2-Partition is NP-complete. 

As a direct consequence of the latter results, we obtain the following theorem. 

Theorem 4.7. Let G he a given graph that is not a cograph. The following three 
optimization problems to find the least integer k > 1 so that there is a Cograph 
k-Partition, or a Cograph k-Decomposition, or a Cotree k-Representation for 
the graph G, are NP-hard. 
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